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SPECIFICATION 
Information processing System, Method of Processing 
Information, and Program for Processing Information 



5 Technical Field 

The present invention relates to an information processing 
system, a method of processing information , a program for processing 
information, and a robot. In particular, the invention relates to 
an information processing system, a method of processing information, 
10 a program for processing information, and a robot, wherein the above 
system, method, program and robot can add and delete information 
when transmitting the result of information processing. 



Background Art 

15 Patent Document 1 , Patent Document 2 , Patent Document 3 , Patent 

Document 4 , Patent Document 5 , Patent Document 6 and Patent Document 
7 disclose conventional information processing systems designed 
to perform interpretation or translation. 

The information processing system disclosed in Patent Document 

20 1 comprises a voice recognition unit, a machine translation unit, 
a voice synthesis unit, a display unit, and an interactive managing 
unit. The information processing system thus configured and 
disclosed in Patent Document 1 operates as will be described below. 

The voice recognition unit recognizes the voice inputted to 

25 it. The result of voice recognition is supplied to the machine 
translation unit. The machine translation unit performs machine 
translation on the result of voice recognition. If the result is 
not composed of data units fitted for machine translation , the machine 
translation unit instructs the voice recognition unit to change 
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the data units. Then, the voice synthesis unit outputs voice. 

The conventional information processing systems disclosed 
in Patent Documents 2 to 4, which are other examples of systems 
that are designed to perform interpretation or translation, have 
5 a voice recognition unit, a machine translation unit and a voice 
synthesis unit, as is disclosed in Patent Document 1. The 
conventional information processing systems disclosed in Patent 
Documents 1 to 4 are all designated to perform interpretation or 
translation. Their objective is to achieve highly-accurate and 
10 natural interpretation or translation. 

In discussing the prior art, Patent Document 5 refers to a 
robot that performs preset motions in accordance with the user's 
instructions or the external conditions. The document discloses 
another robot that comprises a means for recognizing the external 
15 conditions and a means for generating emotion from internal 

conditions. This robot reflects emotional changes on the dialog. 

Patent Document 6 describes a robot that controls characters 
or conditions displayed as images , in accordance with the information 
supplied from a voice recognition unit, a machine translation unit, 
20 a voice input unit or the like and representing the user's emotion. 
Patent Document 7 describes a robot that has a translation function, 
a foreign-language conversation function, a study function of 
determining whether the user's answers are correct, a 
pronunciation-evaluating function, and the like. 
25 Patent Document 1: JP-A-2001-100784 

Patent Document 2: JP-A-2001-100788 

Patent Document 3: JP-A-2001-117921 

Patent Document 4: JP-A-2001-117922 

Patent Document 5: JP-A-2002-283261 
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Patent Document 6: JP-A-2001-117752 
Patent Document 7: JP-A-2002-351305 

Description of Invention 
5 Problems to be Solved by the Invention 

Conventional techniques have been explained, with reference 

to Patent Documents 1 to 7 . The techniques disclosed in these 

documents may be used in appropriate combination, thereby to perform 

interpretation or translation at high accuracy, and various motions 
10 in accordance with signals supplied from external or with the internal 

conditions. However, the conventional techniques have the 

following problem. 

The problem is that the conventional systems exemplified above 

cannot reproduce anything other than the correct result of 
15 information processing. This is because they are configured to 

reproduce the result of information processing without modification, 

exactly as it is. 

Accordingly, an object of the present invention is to provide 

an information processing system, a method of processing information, 
20 a program for processing information, and a robot, which can reproduce 

not only the result of information processing, but also information 

other than the result. 

Means for Solving the Problem 

25 An information processing system according to this invention 

comprises an information analysis unit that analyzes received 
information and determines additional information to be added to 
the received information or deletion information to be deleted from 
the received information, on the basis of result of analyzing the 
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received information; a change processing unit that adds the 
additional information determined by the information analysis unit 
or deletes the deletion information determined by the information 
analysis unit , to or from the received information ; and an information 
5 reproducing unit that reproduces information which has been changed 
by the change processing unit. 

A robot according to this invention comprises an information 
analysis unit that analyzes received information and determines 
additional information to be added to the received information or 

10 deletion information to be deleted from the received information, 
on the basis of result of analyzing the received information; 

a change processing unit that adds the additional information 
determined by the information analysis unit or deletes the deletion 
information determined by the information analysis unit, to or from 

15 the received information; and 

an information reproducing unit that reproduces information 
which has been changed. 

An information processing system according to the present 
invention comprises an information processing device comprising 

20 an information analysis unit that analyzes received information 
and determines additional information to be added to the received 
information or deletion information to be deleted from the received 
information, on the basis of result of analyzing the received 
information, a change processing unit that adds the additional 

25 information determined by the information analysis unit or deletes 
the deletion information determined by the information analysis 
unit, to or from the received information, an information reproducing 
unit that reproduces information which the additional information 
has been added or the deletion information has been deleted, and 
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a first communication means for transmitting a retrieval instruction 
for retrieving the additional information determined by the 
information analysis unit; and 

an information storing device comprising a second 
5 communication means for transmitting and receiving information to 
and from the first communication means, a memory unit that stores 
information to be usable as the additional information, and a 
retrieval unit that retrieves the additional information from the 
memory unit in accordance with the retrieval instruction, 
10 wherein the additional information retrieved from the 

information storing device is transmitted to the change processing 
unit through the second communication means and the first 
communication means . 

A method of processing information, according to the present 
15 invention , comprises a first step of analyzing received information 
and determining additional information to be added to the received 
information or deletion information to be deleted from the received 
information, on the basis of result of analyzing the received 
information; and 

20 a second step of adding the additional information determined 

in the first step or deleting the deletion information determined 
in the first step, to or from the received information. 

A method of processing information, according to the present 
invention, comprises a first step of analyzing received information 
25 and determining information to be added to the received information, 
on the basis of result of analyzing the received information; 

a third step of transmitting an instruction for retrieving 
the added information; 

a fourth step of retrieving the added information from a memory 
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unit that stores information to be usable as the added information, 
in accordance with the instruction for retrieving the added 
information ; 

a fifth step of transmitting the added information retrieved; 
5 a sixth step of adding the information to be added, to the 

received information; and 

a seventh step of reproducing the information to which the 
information to be added has been added. 

A program for processing information , according to the present 
10 invention, is configured to cause a computer to perform: 

a first process of analyzing received information and 
determining additional information to be added to the received 
information, or deletion information to be deleted from, the received 
information, on the basis of result of analyzing the received 
15 information; and 

a second process of adding the additional information 
determined in the first process or deleting the deletion information 
determined in the first process , to or from the received information . 

2 0 Effect of the Invention 

The present invention helps to achieve smooth communication . 
This is because it can transmit additional information, as well 
as the result of information processing. Further, the present 
invention can accomplish communication at high efficiency. This 
25 is because it can delete unnecessary information or redundant 

information from the result of information processing. Moreover, 
the present invention can enhance the quality of communication. 
This is because it can add appropriate detail information to the 
result of information processing. 
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Best: Mode for Carrying Out the Invention 

Embodiment 1 

Best modes for carrying out the present invention will be 
5 described in detail, with reference to the accompanying drawings. 
As shown in FIG. 1, the first embodiment of this invention includes 
a sensor 2 that serves as an information input unit, an information 
processing means 3, an information changing means 5, and an 
information reproducing means 4. These means operate, generally 

10 as follows. 

The sensor 2 as the information input unit receives information 
to be processed, such as signals supplied from outside. The sensor 
2 can be configured to receive all signals such as electric waves, 
light, sound, temperature (thermography) , contact stimulation, and 

15 any possible combination of these. As sensor 2, an acoustic 

transducer, for example, microphone, a CCD camera, and a keyboard 
can be exemplified. The information processing means 3 processes 
the information that the sensor 2 has received, and transmits the 
result of the information processing to the information reproducing 

2 0 means 4. The information processing may be interpretation that 
includes voice recognition, translation and voice synthesis. In 
this case, the information processing means 3 translates the first 
language received by the microphone or keyboard as sensor 2 , into 
the second language . The information changing means 5 adds 

25 information to or delete information from, the result of the 

inf ormationprocessingperf ormedby the inf ormationprocessingmeans 
3. Then, the information is reproduced in the information 
reproducing means 4 . The signal that the information reproducing 
means 4 reproduces includes sound, images (still pictures and moving 
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pictures) , and characters. The information reproducing means 4 may 
be a speaker or a display device such as a liquid crystal display. 

FIG. 2 shows an example of the information processing means 
3, which performs interpretation. This information processing 
5 means 3 comprises a control unit 31, a voice processing unit 32 , 
and an automatic interpretation unit 33 . The voice processing unit 
32 includes a voice recognition unit 321 and a voice synthesis unit 
322. The operation of the voice recognition unit is detailed in, 
for example , Akio Ando , "Real Time Speech Recognition , " The Institute 

10 of Electronics, Information and Communication Engineers , September 
2003 (Document A) . The operation of the voice synthesis unit is 
detailed in, for example, Sadaoki Furui, "Digital Speech 
Processing,'' Tokai University Press, September 1985 (Document B) . 

The control unit 31 controls the operation of the entire system. 

15 It manages and controls the direction of translation between two 
languages, the flow of various information, the addresses of these 
information, and all other things. The voice recognition unit 321 
recognizes the voice inputted to it, on the basis of the attributes 
of the voice. The voice synthesis unit 322 converts the output of 

20 the automatic interpretation unit 33 into voice. The automatic 
interpretation unit 33 receives the speech of a person from the 
voice recognition unit 321 and translates the speech to one in another 
language. The content of the speech sent from the voice recognition 
unit 321 is expressed in one of two languages. Thus, the automatic 

25 interpretation unit 33 has the function of performing interactive 
automatic interpretation . 

FIG. 3 depicts another example of the information processing 
means 3, which operates to achieve multi-lingual interpretation. 
This information processing means 3 includes a control unit 31, 
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a voice processing unit 32 , and an automatic interpretation unit 
33. The combination of the voice processing unit 32 and automatic 
interpretation unit 33 can deal with more languages than the languages 
subjected to interpretation. 
5 The control unit 3 1 controls the operation of the entire system . 

It manages and controls the direction of translation between many 
languages, the flow of various information, the addresses of these 
information, the selection of an appropriate combination of the 
voice processing unit 32 and automatic interpretation unit 33, and 
10 all other things . The other operations of the information processing 
means 3 are identical to those explained with reference to FIG. 
2. 

FIG. 4 shows an example of the information changing means 
5 , which adds information . The information changing means 5 shown 
15 in FIG. 4 includes a change processing unit 51, an 

additional-information generating unit 52, and a control unit 53. 
The additional-information generating unit 52 includes an 
information analysis unit 521, a retrieval unit 522, and a memory 
523. 

20 The change processing unit 51 receives the result of 

information processing from the information processing means 3 and 
actually adds information. The additional-information generating 
unit 52 receives the result of information processing from the 
information processing means 3 via the change processing unit 51 . 

25 The additional-information generating unit 52 then generates 

information that the change processing unit 51 will add. The control 
unit 53 controls the generation of additional information, which 
is performed in the additional-information generating unit 52, and 
the addition of information, which is carried out in the 
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change-processing unit 51. The information analysis unit 521 
analyzes the information, i.e., the result of information processing, 
which has been received from the information processing 3 via the 
retrieval unit 522. The information analysis unit 521 then 
5 determines which information should be added. The analysis is 
composed of a step of classifying the information supplied, a step 
of selecting a category for the additional information, and a step 
of selecting the additional information. 

In the step of classifying the information supplied, the 

10 meaning of the information is interpreted and the information is 
classified as a question, an explanation, or the like. If the 
information supplied is a sentence such as "Is it ... ?", "Whether 
... is questionable ..." or the like, it is classified as a question. 
If it is a sentence such as "This point is explained as being 

15 "It is considered to be ..." or the like, it is classified as an 
explanation . 

In the step of selecting a category for the additional 
information, a category of the additional information is selected, 
which is suitable as a question or an explanation. For example, 

20 an expression prompting an answer to a question can be added to 
a question, and an expression confirming understanding or 
communication can be added to an explanation. The expression 
prompting an answer is, for example, "Please respond to ..." . The 
expression confirming understanding or communication is , for example , 

25 "Do you understand?" or "Do you hear it?". 

In the step of selecting the additional information, one or 
more information are selected from those that belong to these 
categories. The information selected will be used as additional 
information. The information belonging to these categories have 
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been stored into the memory 523 . To select additional information, 
keywords may be extracted from the result of information processing 
and the information stored. The additional information can then 
be selected on the basis of the relation between the keywords thus 
5 extracted. To this end, the additional information which are stored 
into the memory 523 should be classified keyword by keyword. If 
there is a plurality of additional information that can be identified 
by keyword, they can be used in the order they are stored in the 
memory 523 or in the reverse order. Further, additional information 

10 used in the past can be used again after some time from time that 
they were used. Moreover, random additional information may be added 
to the keywords extracted from the result of information processing . 
In this case, random numbers are generated, and additional 
information items that correspond to the random numbers are selected . 

15 Random words may induce unpredictable friendly reactions. In view 
of this, the additional information brings forth a great advantage. 

In other classification, the result of information may be 
classified into a rapid conversation that consists mainly of short 
phrases and sentences, and a relatively formal conversation that 

20 consists of long sentences, through the analysis of the past result 
of information processing and the additional information. In the 
step of selecting the category for the additional information for 
a rapid conversation, short phrases such as shouts and interjected 
chants may be used as additional information because they do not 

25 disturb conversation. In the step of selecting the category for 
the additional information for a formal conversation, long phrases 
and sentences may be used as additional information. "Way to go, 
man!" and "Bravo!" are examples of shouts and interjected chants. 
The time interval from the transmission of the result of 
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information processing to the information reproducing apparatus, 
to the acquisition of the result of the next information processing 
may be evaluated, thereby to estimate the understanding of the other 
party. When the time interval is evaluated, it may be determined 
5 that the response of the other party is delayed. If this is the 
case , an expression prompting an answer or an expression of agreement 
can be added. Example of an expression of agreement are "Exactly , " , 
"You bet!" or "You got it!", "I see." and "I agree.". 

Phrases expressing encouragement or consolation may be added 

10 for a person who feels something upon hearing the interpretation 
result. In this case, the reaction to the information supplied is 
analyzed in the step of classifying the information. The reaction 
is then classified in order to select a category for the additional 
information. If the reaction is classified to one indicating, for 

15 example, a sorrow or a joy, the category of consolation or sympathy 
is selected. 

The retrieval unit 522 retrieves from the memory 523 the 
additional information determined by the information analysis unit 
521. The memory 523 has stored information that may be used as 
20 additional information. 

The additional information that the retrieval unit 522 has 
retrieved is supplied to the change processing unit 51. Examples 
of the information stored in the memory are expressions showing 
interpretation or translation, expressions prompting an answer, 
25 expressions indicating understanding or communication, shouts, 
interjected chants, expressions of agreement, and the like. Among 
the expressions showing that the statement is an interpreted or 
translated one are " so he says", "he is asking you .... What would 
you do?", "he is asking about ..." , and so on. 



- 13 - 



If the information reproducing means 4 is a display device 
or a robot, information other than the linguistic information 
explained thus far can be added. Examples of such information are 
those that represent motions , colors and light . If a display device 
5 is displaying an agent or a robot is used, the motion of the agent 
or robot makes will be the additional information. The agent or 
the robot can make not only motions that man usually does, such 
as nodding, leaning of head, wagging of head and jumping, but also 
motions that man would never make. Examples of the special motions 

10 of the agent or robot can make are the expansion and contraction 
of body and separation of body. Further, the agent can make its 
copies in large or small numbers . In the robot , parts of body , which 
move to make motions, are the arms, the hands, the legs, the trunk, 
the head, the face, and the like. 

15 Color is used as additional information in the case where 

the agent or the robot needs to change in color. For example, the 
color of the agent may be changed, or the light emitted from the 
built-in luminous bodies of varied positions of the agent or robot 
may be changed in color. As such luminous bodies, LEDs can be used, 

20 in addition to electric bulbs of ordinary type. If a plurality of 
luminous bodies are controlled in complex combinations, the agent 
or the robot can look as if it were expressing its emotion. This 
enhances the advantage of using the additional information in the 
present invention . 

25 Another example of using light as additional information is 

a control of a blinking pattern or intensity of light. If the luminous 
bodies are controlled to blink in a specific pattern or to emit 
light of various intensities, the same advantage will be attained 
as in the case where the color is changed as described above. 
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FIG. 5 shows another embodiment of the invention, in which 
the information changing means 5 operates to add information. The 
. additional-information generating unit 52 shown in FIG. 5 includes 
a communication means 524 that is used in place of the memory 523 
5 shown in FIG. 4. It further includes an additional-information 
storage unit 54. That is, the additional-information generating 
unit 52 acquires the additional information from the 
additional-information storage unit 54 via the communication means 
524, in place of the information added by the change processing 
10 unit 51 and stored into the memory 523 . The additional-information 
storage unit 54 includes a communication-processing unit 541, a 
retrieval unit 542, memory 543, and a control unit 544. 

The communication means 541 is paired with, and communicates 
with, the communication means 524. The communication means 541 
15 receives the information that the retrieval unit 522 should retrieve . 
The information to be retrieved, which the communication means 541 
has received, is transmitted to the retrieve unit 542 . The retrieval 
unit 542 retrieves necessary information from the memory 543 and 
transmits it to the retrieval unit 522 via the communication means 
20 541 and the communication means 524. This sequence of operations 
is controlled by the control unit 544 and the control unit 53. 

The memory 543 previously stores information that may be used 
as additional information. The method and means that the 
communication means 541 and the communication means 524 employ to 
25 exchange information about retrieval can be any means available, 
such as wired (cable) transmission, wireless (radio) transmission 
including wireless LAN, and IP communication via the Internet. 

The additional-information storage unit' 54 may be a server 
that is connected via the Internet. Assume that the result of 
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interpretation concerns with, for example, the weather for tomorrow , 
the weather forecast retrieved via the Internet may be added to 
the result of interpretation. Then, the information processed can 
be increased in quality. 
5 The embodiment in which the information changing means 5 adds 

information has been described with reference to FIGS. 4 and 5. 
An embodiment, in which the information changing means 5 operates 
to delete information , can be similarly constructed, too. At first, 
in FIGS. 4 and 5, the additional-information generating unit and 

10 the additional-information storage unit are replaced by an 

information deleting unit and a deleted-inf ormation storage unit, 
respectively. The information analysis unit analyzes the 
information supplied to it and determines which information should 
be deleted, not which information should be added. 

15 The information processing system according to the present 

embodiment is implemented in the form of a portable device such 
as a personal digital assistant (PDA) , a personal computer or a 
cellular telephone. Its components are arranged and combined into 
a single terminal that is portable. The user can therefore use the 

20 system anywhere to make smooth communication with people. 

In the present embodiment, the information changing means 
5 adds information to, or delete information from, the result of 
information processing, which has been obtained in the information 
processing means 3, as has been described above. Hence, an added 

25 value such as emotional expression or detailed information can add 
to the result of information processing performed correctly. This 
helps to accomplish smooth communication. 

The information processing means may perform interpretation 
or translation. In this case, phrases appropriate to the result 
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of interpretation or translation is added. These phrases make the 
communicating partner feel congenial with the user. This 
contributes to smooth communication . Alternatively , it is possible 
to increase the quality of communication by appropriately 
5 summarizing the result of interpretation or translation or by adding 
detailed information . 
Embodiment 2 

This embodiment is identical to Embodiment 1, except that 
. the construction is functionally divided into a terminal 21 and 

10 a server 40 , which are connected by a communication path 80. 

As shown in FIG. 6, Embodiment 2 of the present invention 
includes a terminal 21, a server 40, and a communication path 80. 
The terminal 21 includes a sensor 2, a communication means 211, 
an information changing means 5, and an information reproducing 

15 means 4. The server 40 includes an information processing means 
3, an information changing means 401, and a communication means 
402. The operations of the sensor 2, information processing means 
3 and information reproducing means 4 are the same operations that 
have been already described. The information changing means 205 

2 0 and 401 operate in the same way as the information changing means 
5. These means operate, generally as will be explained below. 

The sensor 2 receives information to be processed, such as 
signals supplied from outside . The communication means 211 supplies 
the information received at the sensor 2 , to the communication means 

25 402 through the communication path 80 . The communication means 402 
supplies the information received at the sensor 2 to the information 
processing means 3 through the communication means 211. The 
information processing means 3 processes the information received 
at the sensor 2 . The information processed is supplied to the 
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information changing means 401 . The information changing means 401 
adds information to, or deletes information from, the result of 
information processing, which has been obtained by the information 
processing means 3. The information changing means 401 then 
5 transmits the resulting information to the communication means 402 . 

The communication means 402 again transmits the result of 
information processing, which has been changed, to the communication 
means 211 via the communication path 80. The communication means 
211 transmits the result of processing, which it has received via 

10 the communication means 402, to the information changing means 205. 
The information changing means 205 adds information to, or deletes 
information from, the result of processing, which it has received 
through the communication means 402, and transmits the resulting 
information to the information reproducing means 4 . 

15 In the sequence of processes , described above , the information 

changing means 401 and the information changing means 205 deletes 
or adds information. Nonetheless, one of the information changing 
means 401 and information changing means 205 may be configured to 
delete or add information . For example , the additional-information 

20 generating unit 52 that is shown in FIG. 4 may be set into the 
information changing means 401, and the change-processing unit 51 
that supplies the additional information that has been generated 
by the additional-information generating unit 52 may be set into 
the information changing means 205. 

2 5 The terminal 21 is implemented as a portable device such as 

a personal digital assistant (PDA) , a personal computer or a cellular 
telephone. No restriction is imposed on the implementation of the 
server 40. The communication path 80 can be any means available, 
such as wired (cable) transmission, wireless (radio) transmission 



including wireless LAN, and IP communication via the Internet. 

The process that the terminal 21 and server 40 perform may 
be interpretation. In this case, the sensor 2, information 
processingmeans 3 and information reproducingmeans 4 can be assigned 
to voice recognition , translation and voice synthesis . For example , 
the information processing means 3 may perform voice recognition 
and translation and the information reproducing means 4 may perform 
voice synthesis. If so, the information transmitted from the 
terminal 21 to the server 40 via the communication path 80 is audio 
information, whereas the information transmitted from the server 
to the terminal 21 is the result of translation. 

The sensor 2 may extract feature quantity of the audio 
information, such as cepstrum, the information processing means 
3 may carry out voice recognition, translation and generation of 
phoneme information, and the information reproducing means 4 may 
synthesize from the phonemes into voice. In this case, the 
information transmitted from the terminal 21 to the server 40 via 
the communication path 80 represents the feature quantity of voice 
and the information transmitted from the server to the terminal 
21 is phoneme information. 

Thus, the sensor 2, information processing means 3 and 
information reproducing means 4 can be assigned to the parts of 
information processing. 

The method and means that enable the communication means 211 
and the communication means 402 to exchange information can be any 
means available, such as wired transmission, wireless transmission 
including wireless LAN, and IP communication via the Internet. 

In FIG. 6, one terminal 21 and one server 40 are illustrated. 
Nevertheless , one terminal and a plurality of severs may be connected 
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by a network, a plurality of terminals and one server may be connected 
by a network or a plurality of terminals and a plurality of servers 
may be corrected by a network . The network can be any means available , 
such as cable transmission , wireless transmission including wireless 
5 LAN, and IP communication via the Internet. 

If a plurality of servers are connected to one terminal by 
a network, the information from the sensor is supplied to the servers 
via the communication means, and the terminal can receive almost 
the same result of processing from the servers. In the terminal, 

10 the results of processing, received from the servers, are compared. 
The most appropriate result of processing can therefore be selected. 
Alternatively, the results of processing, supplied from the servers , 
may be combined in an appropriate manner, thereby to generate a 
better result of processing. Servers can be provided in any number 

15 desired. Consider an embodiment that uses , for example , one terminal 
and ten servers. In this embodiment, ten similar results that the 
ten servers have generated are supplied to the terminal. 

If the servers perform voice recognition or interpretation, 
the selection or combination means provided in the terminal compares 

20 the ten results of voice recognition or interpretation and selects 
the majority prominent result as result that should be transmitted 
to the information changing means . 

In the present embodiment, the information changing means 
401 and the information changing means 205 are configured to add 

25 information to , or delete information from, the result of processing, 
obtained in the information processing means 3. A value such as 
emotional expression or detailed information can therefore be added 
to the result of the processing that has been correctly carried 
out. This helps to accomplish smooth communication. Moreover, in 
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the present embodiment, the sever 40 is assigned to a process that 
includes many operations, and the terminal 21 is assigned to a 
small-scale process and interface operations. The processing can 
therefore be performed at high efficiency in order to achieve smooth 
5 communication. 

In the present embodiment, one or more terminals compare the 
results of processing, obtained by a plurality of servers, and then 
select the most appropriate result or combine some results to generate 
an appropriate result. Hence, the embodiment can achieve smooth 
10 high-quality communication. 
Embodiment 3 

In this embodiment, software implements the functions of the 
information processing means 3 and information changing means 5. 
As shown in FIG. 7, Embodiment 3 includes a computer (i.e. , central 

15 processing unit, processor, and data-processing device) 200, a 
sensor 2 , and an information reproducingmeans 4 . The computer (i.e. , 
central processing unit, processor, and information processing 
device) 900 includes the information processing means 3, and the 
information changing means 5. The computer 900 is composed of, for 

20 example, a CPU that performs operations, a memory unit (e.g. , ROM) 
that stores programs to be executed to perform the functions of 
the information processing means 3 and information changing means 
5, and another memory unit (e.g. , RAM) that stores the information 
supplied from the sensor 2 or the information resulting from the 

25 operations . The computer is configured in the form of, for example, 
a one-chip computer. These means operate, generally as will be 
described below. 

The sensor 2 receives information to be processed, such as 
signals supplied from outside. The information processing means 
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3 processes the information received at the sensor 2 . The information 
processed is supplied to the information reproducing means 4. The 
information changing means 5 adds information to, or delete 
information from, the result of information processing, which has 
5 been obtained in the information processingmeans 3 . The information 
reproducing means 4 reproduces the result of processing, to or from 
which the information changing means 5 has added or deleted 
information . 

FIG. 8 depicts another configuration in which software 

10 implements the functions of the information processing means 3 and 
information changing means 5. As shown in FIG. 8, the main program 
(i.e., information processing program) is read into the computer 
(central processing unit, processor, data processing device) 910 
and controls the operation of the computer 910. The computer 910 

15 is composed of , for example, a CPU that performs operations , a memory 
unit (e.g. , disk drive such as a hard disk drive) that stores programs 
to be executed to perform the functions of the information processing 
means 3 and information changing means 5, and another memory unit 
(e.g., RAM) that stores the information supplied from the sensor 

2 0 2 or the information resulting from the operations. The programs 
to be executed to perform the functions of the information processing 
means 3 and information changing means 5 are installed as needed. 
Thus, the software can be updated. Controlled by the programs, the 
computer 910 performs a processing that is similar to the processing 

25 that the computer 900 performs as has been explained with reference 
to FIG. 7 . 

In the present embodiment, information changing means 5 adds 
information to , or delete information from, the result of information 
processing, which has been obtained in the information processing 



- 22 - 



means 3 . A value such as emotional expression or detailed information 
can therefore be added to the result of the processing that has 
been correctly carried out. This helps to accomplish smooth 
communication . 
5 Embodiment 4 

A robot that uses the information processing system described 
with reference to FIGS. 1 to 5 will be explained. As shown in FIG. 
9, the second embodiment of this invention includes a robot body 
1, a sensor 2, an information processing means 3, an information 

10 changing means 5, an information reproducing means 4, an overall 
control unit 6 , a motion-controlling means 7 , a direction-inputting 
means 10, and a transmitting means 11. The overall control unit 
6 includes an evaluating means 62 and a timing-generating means 
63 . The motion-controlling means 7 includes a motor 71 and a memory 

15 72. The motor 71 is provided to rotate the robot body 1 and moves 
the same up and down. The robot may have arms and legs, looking 
like a man, may be shaped like an animal such as a dog or a cat, 
may have a display device, a microphone and a speaker, looking like 
a machine, or may be provided in other various forms. These means 

20 operate, generally as will be described below. 

The sensor 2, the information processing means 3, the 
information reproducing means 4 , and the information changing mans 
5 operate in the same manner as has been explained. The overall 
control unit 6 uses an evaluating means 62, evaluating the result 

25 of processing, which has been obtained in the information changing 
means 5, and transmits a control signal based on the result of 
evaluation, to the motion-controlling means 7 . The overall control 
unit 6 uses a timing-generating means 63, generating a timing signal 
representing the timing of operating the motion-controlling means 
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7. The timing signal is based on the result of processing, changed 
by the information changing means 5, and the output of the sensor 
2. The overall control unit 6 may use the evaluating means 62, 
evaluating the result of processing, obtained in the information 
5 processing means 3, and may transmit a control signal based on the 
result of evaluation, to the motion-controlling means 7. 

As illustrated in FIG. 9, the evaluating means 62 may evaluate 
the result of changing the information, obtained by the information 
changing means 5, when the information changing means 5 operates 

10 and adds, for example, a joke much related to the result of 

interpretation. In this case, the evaluating means 62 generates 
a control signal that corresponds to the content of that joke. For 
instance, the motion-controlling means 7 can generate a signal that 
causes the robot body 1 to turn to the left and the right, as if 

15 responding to the joke. Further, the motion-controlling means 7 
can generate a signal that causes the robot to make a gesture in 
response to the information reproduced by the information 
reproducing means 4, if the robot can make complex movements. 

On the other hand, the evaluating means 62 may evaluate the 

20 result of processing, obtained in the information processing means 
3 which has performed, for example, interpretation. In this case, 
the evaluating means 61 evaluates the result of the interpretation 
and generates a control signal corresponding to the content of 
interpretation. Assume that the result of interpretation contains 

25 a "laugh." Then, the motion controllingmeans 7 can generate a signal 
that causes the robot body 1 to move up and down, or from right 
to left twitchily. Such a motion is similar to the motion the man 
makes when laughing. Thus, the robot can express a friendly 
character . 
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Usually, the evaluating means 62 gives the motion-controlling 
means 7 one control signal upon evaluating the result of processing, 
obtained by the information changing means 5, and another signal 
upon evaluating the result of processing, obtained by the information 
5 processingmeans 3 . For example, even if the result of interpretation 
contains a "laugh," the evaluating means 62 generates no signals 
that cause the motion controlling means 7 to make the robot body 
1 move up and down, or from right to left twitchily, once the 
information changing means 5 has deleted the "laugh" from the result 

10 of interpretation. 

The motion controlling means 7 receives a control signal from 
the evaluating means 61 at the timing represented by the signal 
transmitted from the timing generating means 63 incorporated in 
the overall control unit 6 . The motion controlling means 7 converts 

15 the control signal to a signal for controlling the motor that actually 
moves the robot body 1. The signal is transmitted to the motor 71. 
The motor 71 generates a rotation or an up-and-down movement in 
accordance with the control signal thus supplied. 

The timing generating means 63 can generate timing signal 

2 0 from all information input to it. The input information includes 
acoustic information, video information, touch information, and 
the like . The acoustic information may represent the voice intensity, 
the direction in which voice is coming, the kind of language, words, 
combinations of words, address, name, personal data, and the like. 

25 Moreover, the acoustic information includes data about the changes 
in these items, such as a change in voice intensity and a change 
from one language to another. These information items can be stored 
in the memory in advance. The motion controlling means 7 doesn't 
necessarily need a timing generating means, if the operation timing 
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of the control signal transmitted from the evaluating means 62 is 
not controversial . 

In the present embodiment, the timing generating means 63 
generates a timing signal on the basis of the output of the sensor 
5 2 and the output of the information changing means 5. Instead, the 
means 63 may generate a timing signal when sensors detect external 
stimuli such as electric waves, light and sound. If this is the 
case, means for transmitting the external stimuli, such as electric 
waves, light and sound, are provided outside , and the external stimuli 

10 are received by an electric-wave receiver, a photoelectric 

transducer, a microphone and the like , which generate timing signals . 
These transmission means may be a remote controller, a personal 
digital assistant (PDA) , a personal computer (PC) and a cellular 
telephone. If the sensor that detects an external stimulus is a 

15 touch sensor, timing information can be transmitted when something 
contacts the touch sensor . A keyboard and a voice recognition device 
may be used as sensors that detect external stimuli. If so, the 
meaning of the information input at the keyboard, the result of 
recognition, and the timing of generating an input can be used as 

20 timing information that should be input. With this configuration, 
the motion can be controlled in accordance with any timing input 
from outside. Hence, smooth communication can be accomplished, 
through various expressions. 

Note that the motion controlling means 7 can have a plurality 

25 of motors. In this case, the motion controlling means 7 can realize 
complex motions, each being a combination of rotations around many 
axes. The motion controlling means 7 may be mechanically more 
complicated and may therefore control more complex motions. 

The motion controlling means 7 controls motions by using the 
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information transmitted from the direction inputting means 10, as 
well as the control signal and the timing information that the 
evaluating means 62 and the timing generating means 63 have 
respectively generated. For example, the evaluating means 62 
5 detects the information about the language, which is contained in 
the output of the information changing means 5 . The motion 
controlling means 7 makes the robot to turn to a listener, by using 
the information about the language and the direction information 
input from the direction inputting means 10 and concerning the 

10 listener listening to what is spoken in that language . The direction 
inputting means 10 receives the direction information about both 
the speaker and the listener and transmits to the motion controlling 
means 7 a control signal that corresponds to the direction information . 
The input to the direction inputting means 10 may be electric waves, 

15 light, sound, or the like. If the direction inputting means 10 is 
a touch sensor, the direction information can be input in the form 
of a contact stimulus or a combination of contact stimuli. Like 
the information stored in the memory 73, the direction information 
that the direction inputting means 10 receives can be expanded in 

20 number of items if multi-lingual interpretation is performed and 
there are many speakers and many listeners. 

The transmitting means 11 generates direction information, 
which the direction inputting means 10 will receive. The 
transmitting means 11 and the direction input means 10 are configured 

25 to handle the same information (i.e. , stimulus) . The transmitting 
means 11 is, for example, a remote controller, a PDA, a PC or a 
cellular telephone. The transmitting means 11 may be eliminated. 

Likewise, the input to the information processing means 3 
may be supplied to the evaluating means 62, instead of the output 
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from the information changing means 5, and the information about 
the language is detected. Then, the motion controlling means 7 can 
perform a control, making the robot turn to the speaker. Further, 
the motion controlling means 7 can make the speaker turn to the 
5 listener immediately after it has stopped speaking, if the timing 
generating means 63 detects the end of the speaker's speech. 

The relation between the control signal supplied from the 
evaluating means 61 and the specific operation of the motion 
controlling means 7 can be preset. The specific operation includes 

10 the operation per se and the timing of the operation. This relation 
is stored in the memory beforehand. Upon receiving the control signal 
from the evaluating means 61, the motion controlling means 7 refers 
to the content of the memory 72 and converts the control signal 
to a signal that controls the motor 71. 

15 The motion controlling means 7 may have another memory in 

addition to the memory 72 . The other memory stores the information 
about the direction in which a subject speaking the first language 
to be interpreted exists and the direction in which another subject 
speaking the second language into which the first language is 

2 0 interpreted exists. The other memory can therefore function as the 
direction inputting means . Hereinafter, the subject who speaks the 
first language will be called speaker , while the subj ect who receives 
the second language will be called listener. The speaker and the 
listener include inanimate obj ects such as robots . If multi-lingual 

2 5 interpretation is performed and there are many speakers and many 
listeners, the information about the direction in which the other 
memory stores data is expanded in accordance with how many languages 
are used and how many speakers and listeners involve. 

The motion controlling means 7 controls the motions by using 
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the information stored in the other memory, as well as the control 
signal and the timing information that the evaluating means 62 and 
the timing generating means 63 have respectively generated. For 
example, the evaluating means 62 detects the information about the 
5 language , which is contained in the output of the information changing 
means 5. The motion controlling means 7 makes the robot to turn 
to a listener, by using the information about the language and the 
direction information stored in the other memory and concerning 
the listener listening to what is spoken in that language. 

10 Likewise, the input to the information processing means 3 

may be supplied to the evaluating means 62, instead of the output 
from the information changing means 5, and the information about 
the language is detected. Then, the motion controlling means 7 can 
perform a control, making the robot turn to the speaker. Moreover, 

15 the motion controlling means 7 can make the speaker turn to the 
listener right after it has stopped speaking, if the timing generating 
means 63 detects the end of the speaker's speech. 

If the direction information is stored and used to control 
the motion, smooth communication can be accomplished, through 

20 various expressions. 

In the present embodiment, information changing means 5 adds 
information to , or delete information from, the result of processing , 
which has been obtained by the information processing means 3. A 
value such as emotional expression or detailed information can 

25 therefore be added to the result of the processing that has been 
correctly carried out. This helps to accomplish smooth 
communication. Furthermore, the present embodiment can accomplish 
smooth communication through various expressions , because themotion 
is controlled in accordance with the result of processing or the 



- 29 - 



information that has been changed. Still further, since a pattern 
of motion is stored, various expressions can be represented by 
processing a small amount of calculation. 

In the present embodiment, the motion is controlled in 
5 accordance with the timing when the input and the result of processing 
are obtained. The embodiment can therefore accomplish smooth 
communication through various expressions. Since any direction 
information can be input from outside, smooth communication through 
various expressions can be performed. Further, since information 
10 of a specific format can be transmitted in this embodiment when 
the direction information is input from outside, smooth 
communication through various expressions can be carried out. 
Embodiment 5 

This embodiment is a robot of another configuration. 

15 As is illustrated in FIG. 10, Embodiment 5 includes a sensor 

2, an information processing means 3, an information changing means 
5, an information reproducing means 4, an overall control unit 6, 
an motion-controlling means 7, a sensor 12, a direction-identifying 
means 13, a transmitting means 14, and a sensor 15. The overall 

20 control unit 6 includes an evaluating means 62, a timing generating 
means 64, and an analyzing means 66. The motion controlling means 
7 includes motors 71 and 171 and wheels 18 and 19. The direction 
identifying means 13 includes a selecting means 131, an identifying 
means 132, and a repeating control means 133. These means operate, 

25 generally as will be described below. 

The sensor 2, the information processing means 3, the 
information reproducing means 4, the information changing mans 5, 
the evaluating means 62, timing-generating means 63, and the motor 
71 operate in the same manner as has been explained. 
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The sensor 12 receives information about the direction of 
the speaker and the listener and supplies this information to the 
direction identifying means 13. The direction identifying means 
13 uses the information received from the sensor 12 , thus identifying 
5 the directions in which the speaker and listener exist , and transmits 
to the motion controlling means 7 a control signal that corresponds 
to the directions identified. If multi-lingual interpretation is 
performed and there are many speakers and many listeners, the 
directions identified by the direction identifying means 13 can 

10 be expanded to a plurality of directions in accordance with how 
many languages are used and how many speakers and listeners exist. 

Assume that the information about the direction of the speaker 
and listener, which the sensor 12 has received, is audio information . 
Then, the sensor 12 is a plurality of acoustic transducer. The 

15 present embodiment will be described, on the assumption that the 
sensor 12 is a plurality of microphones because the microphone is 
a representative example of an acoustic transducer. As is known 
in the art, the direction in which an acoustic signal comes can 
be identified by using the phase difference between the signals 

20 that the microphones receive. 

Ohga, Yamazaki, Kaneda , "Acoustic Systems and Digital 
Processing," The Institute of Electronics, Information and 
Communication Engineers, September 1995 (Document C) discloses a 
plurality of microphones that are arranged in a straight line. No 

25 phase difference exits between the signals received by the 

microphones and coming in directions at right angles to the straight 
line connecting the microphones . On the other hand, signals coming 
in any directions other than the directions at right angles to the 
straight line have phase difference between them. This is because 
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the signals reach the microphones at different times. 

This phase difference, i.e., time difference, is uniquely 
determined by the sonic speed, the interval between the microphones 
and the angle between the signal-coming direction and the straight 
5 line connecting the microphones . Hence, the direction in which the 
signal comes can be identified by evaluating the phase difference 
between the signals that the microphones have received. This means 
that the direction of the speaker can be identified by using the 
speaker ' s speech . 

10 The relation between the speaker and the listener repeatedly 

changes in the scenario of interpretation . Thus , once the direction 
of the speaker is identified at a certain time, the direction of 
the listener can be identified when the listener becomes a speaker, 
speaking in the second language. 

15 Assume that the information about the direction of the speaker 

and listener, which the sensor 12 has received, is video information. 
In this case, the sensor 12 is a plurality of cameras. As is known 
in the art, the direction of the speaker can be identified by 
performing personal identification on the speaker. Namely, the 

20 images obtained by the sensors 12 are examined, and if these images 
include one showing the speaker, the direction corresponding to 
the image is regarded as the direction of the speaker. 

Generally, it is difficult to determine whether the person 
detected is the speaker or the listener, from the video information 

25 only. It is therefore necessary to detect the direction from audio 
signals or to analyze the motion of the lips, thereby to identify 
the speaker. Details of the person recognition are described in, 
for example, JP-A-2002-56388 (Document D) . The face, the eyes, or 
the lips may be detected in order to recognize or identify the person , 
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instead of detecting the person. 

The sensor 12 can be configured to receive all signals 
represented by electric waves, light, sound, temperature 
(thermography) , contact stimulation, or any possible combination 
5 of these. As sensor 12, a keyboard or a voice recognition device 
can be exemplified as has been explained. 

The direction identifying means 13 may receive not only the 
output of the sensor 12 , but also the timing control signal generated 
by the timing-generating means 63 as input. The timing generating 

10 means 63 can control the timing of identifying the direction from 
the information receivedby the sensor 12 , by using the signal received 
from the timing-generating means 64. If this is the case, the 
direction can be identified at the timing that the input and the 
result of processing are obtained. Thus, smooth communication can 

15 be accomplished on the basis of themore accurate result of identifying 
the direction. 

The overall control unit 6 may control the timing generating 
means 64 , causing the same to generate the timing at which the motion 
controlling means 7 operates, by using the result of processing 

2 0 obtained by the information changing means 5 and the inputs obtained 
from the sensor 2 and sensor 12. In particular, if the sensor 12 
receives video information, the timing generating means 64 uses 
both the audio information it has received from the sensor 2 and 
the video information it has received from the sensor 12 , and generates , 

25 at high accuracy, the timing of operating the motion controlling 
means 7 . Since the timing of operating the motion controlling means 
7 reflects the timing at which the result of processing and the 
information about the direction are obtained, smooth communication 
through various expressions can be accomplished. 
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The transmitting means 14 transmits a specific signal. The 
sensor 12 detects the reflection signal generated at the speaker 
and contained in the signal transmitted from the transmitting means 
14. The direction identifying means 13 analyzes the reflection 
5 signal supplied from the sensor 12 and detects the direction of 
the speaker and listener. The signal that the transmitting means 
14 transmits may be electric waves, light, sound, or the like. Of 
these examples, the most readily usable is ultrasonic waves. Since 
the direction is identified and the timing is generated on the basis 

10 of the reaction to the specific information thus transmitted, smooth 
communication through various expressions can be carried out. The 
transmitting means 14 may be eliminated. 

The direction identifying means 13 includes the selecting means 
131, the identifying means 132 and the repeating control means 133. 

15 The selecting means 131 and the repeating control means 133 may 
be eliminated. 

If the selecting means 131 and the identifying means 132 are 
provided as in this embodiment, the transmitting means 14 transmits 
the specific signal to a limited region. The sensor 11 detects the 

20 reflection signal generated at the speaker and contained in the 
signal transmitted from the transmitting means 14. The direction 
identifying means 13 includes the selecting means 131 and the 
identifying means 132. The selecting means 131 selects only the 
reflection signal included in the signals received by the sensor 

25 12, which corresponds to the region to which the transmitting means 
14 should transmit the specific signal . Using the reflection signal 
thus selected, the identifying means 132 identifies the direction 
of the speaker . 

The sensor 12 receives a signal transmitted from the 
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transmitting means 14 and reflected, in the embodiment described 
above. Even if the sensor 12 receives information not related to 
reflection, the direction of the speaker can be identified on the 
basis of the same principle as described above, as the selecting 
5 means 131 selects a reflection signal and the identifying means 
132 identifies the direction of the speaker from the reflection 
signal selected. 

If constituted by the selecting means 131 and the identifying 
means 132, the direction identifying means 13 can further limit 

10 the region in which information is received to identify the direction . 
Therefore, smooth communication can be achieved on the basis of 
the more accurate result of identifying the direction. 

The repeating control means 133 incorporated in the direction 
identifying means 13 repeatedly identifies the direction of the 

15 speaker with respect to a plurality of regions, by using the signal 
received by the sensor 12, selected by the selecting means 131 and 
identified by the identifying means 132. In this embodiment, the 
identifying means 132 is repeatedly operated to process information 
received from another region. Instead, a plurality of identifying 

2 0 means may be used, and appropriate signals may be distributed to 
these identifying means . Therefore, the directions of the speakers 
existing in the regions can therefore be identified . Hence , a speaker 
whose direction is not known at all can be detected by finding a 
person in a particular region, and a plurality of speakers can be 

25 detected. The repeating control means 133 may be eliminated. 

If the sensor 12 receives video information, it is possible 
to detect the speaker and the direction of the speaker and to acquire 
information representing the direction of a listener by using the 
information representing the direction of the speaker. This is 
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because the speaker usually faces the listener while speaking to 
the listener. The direction of the speaker can be determined by 
detecting the direction of the speaker's body or the direction of 
one's eyes, from video information. The direction to which the 
5 speaker turns is converted, if detected, to a direction of the speaker 
as viewed from the sensor 12, by using the direction information 
of the speaker. The direction information thus obtained is used 
in combination with the information about the direction of the 
listener . The direction of the listener can therefore be identified 
10 more accurately than otherwise. The information obtained by 

converting the orientation of the speaker may of course be used 
as one representing the directionof the listener if it is sufficiently 
accurate . 

Moreover, audio information can be used in this sequence of 
15 operations. If this is the case, the sensor 12 must be of the type 
that can receive both video information and audio information, and 
direction identifying means 13 must have a function of processing 
audio information. The audio information can be the start/end of 
a speech, a switching of languages, personal voice characteristics, 
20 or the like. That is, the function of processing audio information 
includes voice detection, language identification and personal 
identification based on voice. 

In the present embodiment, the region in which to receive 
the information for identifying the direction is limited. This 
25 limited region is changed, thus repeating the identification of 
direction. Therefore, the identification of direction can be not 
only accurate, but also be applied to a plurality of 
speakers/listeners. Smooth communication can therefore be 
accomplished . 
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A sensor 15 receives information about the distance between 
the robot and the speaker and the distance between the robot and 
the listener and transmits this information to the analyzing means 
66. The analyzing means 66 evaluates the information transmitted 
5 from the sensor 15, finds these distances and the listener and 
transmits the distance information to a motion controlling means 
17. The motion controlling means 17 controls the wheels 18 and 19, 
thus moving the robot. The distances between the robot and the 
speaker and the distance between the robot and the listener is thereby 
10 adjusted. 

More specifically, if the information transmitted from the 
analyzing means 66 represents distances that are shorter than a 
predetermined distance , the wheels 18 and 19 are controlled, thereby 
making the robot body approach the speaker or the listener. 

15 Conversely, the information transmitted from the analyzing means 
66 may represent distances that are longer than the predetermined 
distance. If so, the wheels 18 and 19 are controlled, making the 
robot approach the speaker or the listener . The sensor 15 , analyzing 
means 66 and wheels 18 and 19 may be eliminated. 

20 In the embodiment of FIG. 10, the motion controlling means 

16 is configured to control the wheels 18 and 19. The wheels can 
be provided in any desired number. In this embodiment, the 
robot-moving means is constituted by the wheels 18 and 19. 
Nonetheless, the robot-moving means may be constituted by components 

25 other than wheels. Caterpillars or a multi-leg walking mechanism 
can be employed for the robot-moving means . 

The sensor 15 is, for example, an ultrasonic-wave sensor. 
The sensor 15 may be an image sensor that includes an imaging element 
and the analyzing means 66 may be an image-recognizing means . Thus , 
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it is possible to adjust the distance between the robot and the 
speaker and the distance between the robot and the listener based 
on images. In this case, use can be made of various information 
items representing the face (size of the face) , the eyes (their 
5 relative positions or the distance between them) , or the lips (their 
relative positions) , as has been explained in conjunction with the 
sensor 12. 

The sensor 15 may be a plurality of acoustic transducers and 
the analyzing means 66 may be configured to detect the direction 

10 of a sound source. Then, the distance between the robot and the 
speaker and the distance between the robot and the listener can 
be adjusted, using the principle of trigonometrical survey. 
Otherwise, the intensity of sound or light may be utilized. 

The distance between the robot and the speaker and the distance 

15 between the robot and the listener can be controlled in accordance 
with the information supplied from outside. Smooth communication 
through various expressions can therefore be accomplished. 

The sensor 2 may be removed from the robot body 1 and provided 
in a terminal, a communication means may be provided in each of 

2 0 the robot body and the terminal, and the sensor output may be 

transmitted from the terminal to the robot body via a communication 
path. In this configuration, the terminal having the sensor 2 can 
be moved independently of the robot. The sensor 2 can be positioned 
near a signal source. When the sensor 2 receives audio information, 

25 noise mixing into the signal can be relatively small because the 
sensor 2 is positioned near the mouth, i.e., the signal source. 
Therefore, the signal subjected to identifying process has a high 
signal-to-noise ratio, which leads to the improvement of the 
recognition ratio. This serves to achieve smooth communication in 
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high quality. 

As in the embodiment shown in FIG. 6, the sensor 2 , information 
processingmeans 3 and information reproducingmeans 4 can be assigned 
to the parts of information processing. A plurality of robots may 
5 be provided in the terminal that has the sensor 2 . The terminal 
can then make the robots perform the processing in the same manner. 
Thus, appropriate robots can be selected and operated in accordance 
with the languages used and the nature of the information received 
by the sensor 2 . Since the terminal 24 make appropriate communication 
10 with the robots, making them perform parts of the information 

processing, respectively, smooth, high-quality communication can 
be accomplished. 

The sensor 2 , information processing means 3 and information 
reproducing means 4 can be appropriately assigned to the parts of 
15 inf ormationprocessing, in the same way as in the embodiment explained 
with reference to FIG. 6. 

As has been described, the information changing means 5 used 
in this embodiment is configured to add information to, or delete 
information from, the result of information processing, which has 
20 been obtained by the information processing means 3. A value such 
as emotional expression or detailed information can therefore be 
added to the result of the processing that has been correctly carried 
out. This helps to accomplish smooth communication. In addition, 
the present embodiment is configured to calculate automatically 
25 the direction information from the information input from outside. 
Thus, smooth communication through various expressions can be 
accomplished. 
Embodiment 6 

This embodiment is another configuration of a robot. 
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As shown in FIG. 11, this embodiment includes a terminal 24, 
a server 40, a robot 25, and communication paths 85 and 87. The 
terminal 24 includes a sensor 2 and a communication means 211. The 
robot 25 includes a communication means 70 and an information 
5 reproducing means 4. These means operate, generally as will be 
described below. 

The sensor 2 receives information to be processed, such as 
signals supplied from outside. The communication means 211 
transmits the information. that the sensor 2 has received, to a 
10 communication means 402 through the communication path 85. The 
communication means 402 transmits the information received at the 
sensor 2 from the communication means 211 through the communication 
path 85, to the information processing means 3. The information 
processing means 3 processes the information received at the sensor 
15 2, and transmits the information to an information changing means 
401. 

The information changing means 401 adds information to, or 
deletes information from, the result of information processing, 
which has been obtained by the information processing means 3, and 

20 transmits the resulting information to the communication means 402 . 
The communication means 402 transmits the result of processing to 
the communication means 7 0 through the communication path 87. The 
communication means 7 0 transmits the information received via the 
communication means 402, to the information reproducing means 4. 

25 The sensor 2, information processing means 3 and information 

reproducing means 4 can be appropriately assigned to the parts of 
information processing , in the same way as in the embodiment explained 
with reference to FIG. 6. The communication paths 85 and 87 may 
be any means available, such as wired transmission, wireless 
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transmission including wireless LAN , and IP communication via the 
Internet . Further, terminals 22 and 23 can be implemented as portable 
devices such as personal digital assistants (PDAs) , personal 
computers (PCs) or cellular telephones. 
5 The present embodiment shown in FIG. 11 is similar in 

configuration to the embodiment of FIG. 6, except that the function 
of the terminal 21 is removed, that the information-inputting 
function of the sensor 2 is assigned to the terminal 24, and that 
the information reproducing function of the information reproducing 

10 means 4 is assigned to the robot 25. 

The server 40 may be configured to transmit the information 
output from the information changing means 401 , to two or more robots . 
In this configuration, the terminal 24 can make appropriate 
communication with the robots, and the robots can be assigned to 

15 parts of the information processing. Hence, smooth, high-quality 
communication can be accomplished. Since the sensor 2 is provided 
in the terminal 24 and thus arranged near the signal source, the 
rate of recognizing the sound or images can increase. This 
contributes to the smooth, high-quality communication. 

20 In the present embodiment, the information changing means 

5 is configured, as described above , to add information to , or delete 
information from, the result of information processing, which has 
been obtained by the information processing means 3. A value such 
as emotional expression or detailed information can therefore be 

25 added to the result of the processing that has been correctly carried 
out. This helps to accomplish smooth communication. Moreover, the 
processing can be performed at high efficiency to achieve smooth 
communication, because the sever 40 is assigned to a process that 
includes many operations and the terminal 23 is assigned to a 
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small-scale process and interface operations. 

Further, according to the present embodiment, the rate of 
recognizing the sound or images can increase since the sensor 2 
is provided in the terminal 24 and thus arranged near the signal 
5 source. This contributes to the smooth, high-quality 
communication . 
Embodiment 7 

An apparatus using the information processing system described 
with reference to FIGS. 1 to 5 , which represents information such 

10 as an added value in the form of the motion of an agent displayed 
as an image, will be described with reference to FIGS. 12 to 17. 
This apparatus can be used in car navigation, language-teaching 
systems, translating machines, interpretation machines, and the 
like. The functions of the information processing means, 

15 information changingmeans and overall control unit of this apparatus 
can be implemented by the software installed in a personal computer. 
FIG. 12 is a block diagram of an apparatus according to this embodiment, 
illustrating the configuration thereof . FIG . 13 is a diagram showing 
how the user inputs information to the sensor. FIG. 14 is a diagram 

20 depicting an apparatus that uses a microphone as sensor 2. FIGS. 
15 to 17 are diagrams explaining how the agent moves on the information 
reproducing means 4 . 

As shown in FIGS. 12 to 14, the apparatus according this 
embodiment includes a main body 1, a sensor 2 (microphone 21) , an 

25 information processing means 3, an information changing means 5, 
an information reproducing means 6 and an overall control unit 6. 
All of these means, but the information reproducing means 4, operate 
as has been explained above . Only the information reproducing means 
4 will be described. 
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The information reproducing means 4 can be a flat display 
device such as a liquid crystal display or an EL display, a CRT, 
or the like. As FIG. 15 shows, an agent may move up and down on 
the screen 41. As FIG. 16 shows, the agent may move slantwise, while 
5 rotating, on the screen 41 in accordance with added information. 
Further, as FIG. 17 depicts, the agent may move up and down, while 
rotating. So moving, the agent can represent added information. 
In the information reproducing apparatus 4 shown in FIGS. 15 to 
17, the speakers 42 to 45 can generate sound and the screen 41 can 

10 display the agent. As has been explained, the agent can make not 
only motions that man usually does, such as nodding, leaning of 
head, wagging of head and jumping, but also motions that man would 
never make. Examples of the special motions the agent or the robot 
can make are the expansion and contraction of body and separation 

15 of body. Further, the agent can make its copies in large or small 
numbers . 

Further, the speakers 42 to 45 can be used to control the 
position of the audio image. That is, the amplitude and phase of 
the signal supplied to each speaker are controlled appropriately, 
20 thereby controlling the volume of reproduced sound at a specific 
distance and in a specific direction. 

Indus-trial Applicability 

The present invention can be applied to an information 
25 processing system that performs interpretation, translation, 
conversation, voice recognition, voice synthesis, image 
understanding, and the like, to a robot that functions as this system, 
and to a program that enables a computer to work as the system. 
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Brief Description of Drawings 

FIG. 1 is a block diagram illustrating an example of the 
configuration according to a first embodiment of the present 
invention; 

5 FIG. 2 is a block diagram illustrating a first example of 

the information processing means used in the first embodiment of 
the present invention; 

FIG. 3 is a block diagram illustrating a second example of 
the information processing means used in the first embodiment of 
10 the present invention; 

FIG. 4 is a block diagram illustrating a first example of 
the information changing means used in the first embodiment of the 
present invention ; 

FIG. 5 is a block diagram illustrating a first example of 
15 the information changing means used in the first embodiment of the 
present invention ; 

FIG. 6 is a block diagram illustrating an example of the 
configuration according to a second embodiment of the present 
invention ; 

20 FIG. 7 is a block diagram illustrating an example of the 

configuration according to a third embodiment of the present 
invention ; 

FIG. 8 is a block diagram illustrating another example of 
the configuration according to the third embodiment of the present 
2 5 invention ; 

FIG. 9 is a block diagram illustrating an example of the 
configuration according to a fourth embodiment of the present 
invention ; 

FIG. 10 is a block diagram illustrating an example of the 
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configuration according to a fifth embodiment of the present 
invention ; 

FIG. 11 is a block diagram illustrating an example of the 
configuration according to a sixth embodiment of the present 
5 invention; 

FIG. 12 is a block diagram illustrating an example of the 
configuration according to a seventh embodiment of the present 
invention ; 

FIG. 13 is a diagram explaining how the user of the apparatus 
10 inputs information to the sensor; 

FIG. 14 is a diagram illustrating the apparatus that uses 
a microphone as sensor 2 ; 

FIG. 15 is a diagram explaining how the agent moves on the 
information reproducing means 4 ; 
15 FIG. 16 is another diagram explaining how the agent moves 

on the information reproducing means 4 ; and 

FIG. 17 is a further diagram explaining how the agent moves on the 
information reproducing means 4 . 
Explanation of references 
20 1: Robot body (Main body) 

2, 12, 15: Sensor 
21, 24: Terminal 

211, 402, 70: Communication means 

3: Information processing means 

25 31, 53, 544: Control unit 
32: Voice processing unit 
321: Voice recognition unit 
322: voice synthesis unit 
33: Automatic interpretation unit 
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4 : Information reproducing means 

40: Server 
25: Robot 

5, 401: Information changing means 

5 50: Network 

51: Change processing unit 

52 : Additional-information generating unit 

521: Information analysis unit 

522, 542: Retrieval unit 
10 523, 543, 72: Memory 

524, 541: Communication means 

54: Additional-information storage unit 

6: Overall control unit 

62 : Evaluating means 
15 63, 64: Timing generating means 

7, 17: Motion controlling means 

71, 171: Motor 

80, 85, 87: Communication path 

11, 14: Transmitting means 

20 10: Direction inputting means 

13: Direction identifying means 

131: Selecting means 

132: Identifying means 

133: Repeating control means 
25 18, 19: Wheel 



